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Abstract — Space Division Multiple Access (SDMA) based 
Medium Access Control (MAC) protocols have been proposed to 
enable concurrent communications and improve link throughput 
in Multi-Input Multi-Output (MIMO) Ad Hoc networks. For the 
most part, the works appearing in the literature make idealized 
and simplifying assumptions about the underlying physical layer 
as well as some aspects of the link adaptation protocol. The 
result is that the performance predicted by such works may not 
necessarily be a good predictor of actual performance in a fully 
deployed system. In this paper we look to introduce elements into 
the SDMA MAC concept that would allow us to better predict 
their performance under realistic operating conditions. Using a 
generic SDMA-MAC we look at how the network sum throughput 
changes with the introduction of the following: (a) use of the 
more practical MMSE algorithm instead of the zero-forcing or 
SVD based nulling algorithms used for receive beamnulling; 
(b) impact of channel estimation errors; (c) introduction of 
link adaptation mechanism specifically designed for concurrent 
SDMA MACs; (d) incorporation of TX beamforming along with 
RX beamnulling. Following on the transmission window during 
which concurrent transmissions are allowed by the MAC, we 
qualify the impact of each of these four elements in isolation. At 
the conclusion, the performance of a system that incorporates 
elements a — d is presented and compared against the baseline 
system, showing an improvement of up to 5x in the overall 
network sum throughput. 

Index Terms — Multi-Input Multi-Output (MIMO), Space Di- 
vision Multiple Access (SDMA), Medium Access Control (MAC), 
Concurrent Communications. 



I. Introduction 

Networks of MIMO (Multi-Input Multi-Output) enabled 
nodes can use advanced eigen-beamforming and beamnulling 
techniques to enable concurrent communications and increase 
overall network throughput. This technique is loosely referred 
to as space division multiple access and several medium access 
control (MAC) protocols have appeared in the literature that 
can deliver concurrent transmissions in an Ad Hoc network of 
multi-antenna, MIMO, enabled nodes [1|-|5|. 

Although SDMA (space division multiple access) and con- 
current links have been well studied in cellular networks (see 
[6 1 and the references therein), it is still a challenging problem 
in Ad Hoc networks. Initially, SDMA and concurrent links 
were utilized in Ad Hoc networks via a simple abstract model 
called Degree of Freedom (DOF) Q, ||8). This model uses 
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the number of antennas to represent the number of concurrent 
links in the network. It assumes that the concurrent links are 
perfectly separated and do not interfere with one another. 
As such, the DOF model ignores all physical layer (PHY) 
impairments. At the same time, using TX/RX beamforming, 
the SPACEMAC, MIMAMAC and NullHoc protocols CQ-0 
have been proposed to support concurrent links in Ad Hoc 
networks. These protocols assume that the first node to "win" 
the contention window will use an omni directional radiation 
pattern, but other, secondary, users will use TX beamforming 
to ensure that newly accessing link will not introduce any inter- 
ference to existing links. As a result, the throughput of existing 
links is not affected, and additional network throughput can be 
had as a result of the newly formed concurrent links. Although 
this idea works well under an ideal MIMO system model, 
its performance is significantly affected by physical layer 
constraints and imperfections (e.g., channel estimation error, 
absence of link adaptation, etc.). In this paper we focus on 
the concurrent transmission window within a generic SDMA- 
MAC that is similar in its construction to SPACEMAC and 
NullHoc. These MACs were proposed for Ad Hoc networks 
and typically use signaling during the contention window to 
determine the TX & RX beam patterns to be used during 
the concurrent transmission windows. They typically make 
idealized and simplified assumptions that will impact their 
performance during the concurrent transmission window. In 
this paper our aim is to migrate an idealized SDMA-MAC 
system, such as the ones found in |fl]-||4), towards a more 
realistic one that incorporates (a) channel estimation error, 

(b) the use of a more practical MMSE detection algorithm, 

(c) incorporation of link adaptation, and (d) combined TX 
and RX beamforming techniques. Our study uses the generic 
SDMA-MAC protocol presented in section II as a baseline, 
but the results could be easily extended to other MACs with 
a similar structure. For each of the elements (a) through 

(d) we compare the performance of the baseline SDMA- 
MAC during the concurrent transmission window with and 
without the proposed modification. We then combine all the 
changes together and compare the performance of the resulting 
"practical" MAC with the baseline system. To this end, the 
paper will be organized as follows. Section II introduces our 
system model, the baseline SDMA-MAC, and the simulation 
setup. Section III describes each of the four elements of 
our proposed modifications and the associated performance 
gain/loss for each element in isolation. In section IV we 
provide side by side comparisons of the baseline concurrent 
SDMA-MAC, the realistic variant of the concurrent SDMA- 
MAC which includes elements a-d, and a non-concurrent MAC 
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that utilizes MIMO links. The paper is then concluded in 
Section V. 

II. System Description 

This paper focuses on a single hop Ad Hoc network, 
where each node is within the transmission range of all other 
nodes. There are a total of K concurrent links simultaneously 
transmitting in the network, labeled as link L\ to link Lk- 
The TX node and RX node involved in link L q are denoted 
as T q and R q , respectively. Every node is equipped with Na 
antennas, and all packets are modulated using OFDM (Orthog- 
onal Frequency Division Multiplexing), where the number of 
subcarriers is Nc- The TX power per node is the same and is 
denoted by Pt- The fast fading channel from the TX node T q 
to the RX node R q at the zth subcarrier is Yi R : t (i), which 
is an Na x Na matrix of complex Gaussian random variables 
with zero mean and unit variance. G R t„ is the path loss 
from node T q to node R q . For simplicity, we assume that each 
link uses a single spatial stream. However, our discussions can 
be easily generalized to other cases where some links might 
use multiple spatial streams. We denote the power normalized 
Na x 1 TX vector at the ith subcarrier of node T q as (i) 
(power normalized implies that (i)Wr (i) = 1). Simi- 
larly, the RX vector at the ith subcarrier of node R q is W Rq (i) 
subject to Wjf (i)W Rq (i) = 1. Also, the transmitted QAM 
symbol at each subcarrier is zero mean and unit variance. 
Finally, we use A(i) to represent the matrix corresponding 
to the ith subcarrier, and A(i,j) is the jth column of matrix 
A(i). [•] and [•] are Hermitian and transpose calculation. 

A. Overview of the Generic SDMA-MAC 

Our baseline SDMA-MAC is designed to represent a class 
of MACs such as SPACEMAC Q, and NullHoc O, 0, 
it is built on the principal that links have an access hierarchy 
in that the newly accessing link should cause no interference 
to the existing links. This process is described mathematically 
in the following subsection. 

B. Mathematical Description of a Generic SDMA-MAC 

Let the access order of K concurrent links in the network 
range from L\ (link 1) to Lk (link K). Currently the first 
(Q — 1) links have accessed the channel, and now we look at 
the access process of link Lq, 1 < Q < K. According to our 
generic SDMA-MAC protocol, during the concurrent trans- 
mission window, link Lq should use a TX vector, W^fi), 
that is orthogonal to the existing links' RX vectors W R (i), 
or equivalently: 

y/PrG Rq , T(i /N c W% q (*)H^ Tq (i) W Tq (i) = 0, 

1<«<(Q-1). CD 

To calculate the TX vector, Wx Q (i)> we start with 
Hi n tf. t q GO which represents all interference channels from 
node T Q to node (R u R 2 , ... , R^ Q _^). H int f jTQ (i) is an 
Na x (Q — 1) matrix, whose q th column is: 

HEU^.?) = yP T G Rt ,T Q /NcW% q (i)H.R g ,T Q (i)} H , 
1 < q < (Q - 1). (2) 



Node Tq then runs a Singular Value Decomposition (SVD) 
on H; nt f ,t q (*)H^ tf T (i). Assuming non-increasing order of 
eigen-values in the SVD result, node Tq's TX vector is 
calculated as: 

H int f, Tg (i)Hg t f !rQ (i) = U To (i)AT (i)U^(i), (3) 
W TQ (i) = U TQ (i,N A ). (4) 

Here Ut q (i, Na) is the (A^) th column in the matrix Ut q («)• 
Next we calculate W^ Q (i) based on both the desired chan- 
nel coming from node Tq and the interference channel coming 
from nodes (Ti, T2, T^q^^). Here the interference channel 
from T q is denoted as y/PrG RQt T q /Ncll RQjTq {i)WT q {i) 
with 1 < q < (Q — 1). The desired channel from Tq is 
V P tG Rq ,t q /N c H Rq >Tq (») W Tq (i). Node R Q 's RX vector 
Wrq (i) can be derived using a MIMO detection algorithm 
(zero-forcing is used in the generic SDMA-MAC). Details 
relating to link contention, handshaking, and channel informa- 
tion exchange are referred to lOQ-O]. These are not considered 
here as the focus of this work is the achievable real-world 
performance of the concurrent SDMA-MAC during the data 
transmission phase of the protocol. Admittedly the structure 
and mechanism of the contention windows, RTS, CTS, etc., 
will impact the overall performance of the network. However, 
in the interest of maintaining focus we have chosen to defer 
these issues to a possible follow on contribution. Therefore 
care must be taken to incorporate all MAC specific overheads 
when translating the results to estimate MAC efficiency or 
throughput performance. 

At this juncture it is worth introducing some underlying 
assumptions or limitations in our generic SDMA-MAC which 
also appear in SPACEMAC El, El and NullHoc 0, 0. 

1) SDMA-MAC often assumes perfect channel estimation 
in the system (T), G). 

2) TX and RX vectors are calculated using the zero-forcing 
or SVD based algorithm fl]-||4). 

3) Each link simply uses a fixed modulation scheme, 
specifically, the 2Mbps mode in 802.11b. Multi-rate 
capabilities embedded within the concurrent links are 
not fully utilized. 

4) TX vectors are calculated to minimize the resultant 
interference on the existing links. But the optimization 
of SNR within the desired communication is not con- 
sidered. 

5) Simulations in SPACEMAC and NullHoc consider at 
most concurrent links, and the capability of sup- 
porting more than Na links is not evaluated. 

C. Description of the Simulation Environment and Metric 
Used 

Our simulations are conducted in a single-hop Ad Hoc 
network, where all concurrent links are randomly and uni- 
formly placed in a rectangle box of 200m by 200m. Each 
node is equipped with Na = 4 antennas, and uses a single 
spatial stream. The system bandwidth, W, is assumed to be 
W — 20MHz. The modulation is assumed to be OFDM with 
Nc = 64 subcarriers and the guard interval is pa = 1/4. 
We assume no power control in the network with the total 
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TABLE I 

List of Modulation and Coding Schemes 



MCS 
Index 




Coding 
Rate 


Minimum required effective 

I I 1 > IX IU ill III I V L Id I gel 

BER/PER (10% PER) 





BPSK 


1/2 


1.4 dB 


1 


QPSK 


1/2 


4.4 dB 


2 


QPSK 


3/4 


6.5 dB 


3 


16QAM 


1/2 


8.6 dB 


4 


16QAM 


3/4 


12 dB 


5 


64QAM 


2/3 


15.8 dB 


6 


64QAM 


3/4 


17.2 dB 


7 


64QAM 


5/6 


18.8 dB 



TX power per node Pt = 25dBm. Power decay between any 
two nodes is calculated according to the simplified path loss 
model [7 1 with an exponent of 3, da = lm, and wave-length 
A = 0.125m. Fast fading Rayleigh channels are kept invariant 
during the transmission period. Background noise power per 
subcarrier is a 2 N = — 113dBm. When link adaptation is 
enabled, the link can pick one of the eight modulation and 
coding schemes (MCS) shown in Table U Also, all packets 
carry the same amount of data, namely, 100 bytes. We use 
MATLAB software to build our simulation framework, and 
each point in our results is an average of 1000 independently 
generated topologies. 

We use the network sum-throughput metric J8), (9) in our 
study which measures the successfully transmitted throughput 
summed from all links. However, in order to decide if a 
particular link is viable or not, we calculate the effective 
Post Processing SNR (PPSNR) [10] at the receiver for each 
link. For a given MCS, if the effective PPSNR is above the 
minimum required for the desired QoS (see Table JJ, then we 
declare the link viable and include the link throughput within 
the sum-throughput calculation. Otherwise, the link is assumed 
not to be usable. The effective PPSNR is calculated as follows. 
Once the TX vector Wtq (i) and the RX vector W^ Q (i) have 
been determined for all links in the network, the PPSNR can 
be calculated as: 



|w£ o «.h«- Tq (z) 



K 

£ 

9=1, 9#Q 



|wg Q (i)-H^, T<! (i 



N 



(5) 



H 



Roc 
Rq,T, 



(<) 



PTG RQ , T jN c U RQ , Tq (i}W Tg (i). (6) 



Using the PPSNR T RQ (i) and T RQAB (i) 

10 log 10 (t Rq («)), we then calculate link Lq's effective 

PPSNR rf Q , dB via Eqn. © [10] as 



r 



off 



1 



\ " 



«Q,dB - N /, 



r_R Q ,dB(0 - a ■ var [T RQ , dB (i)] ■ (J) 



i=l 

Here variance var is calculated over all subcarriers, and a — 
0.125 is fitted offline IfTOl. 

III. Quantifying Performance with Realistic 
Parameters and Algorithms 

This section is broken down into 4 subsections. In these 
subsections we separately look at the impact of MMSE, 



channel estimation error, link adaptation, and TX beamforming 
on the sum-throughput of our generic SDMA-MAC. These 
will be studied in isolation of one another. Section IV will 
then evaluate the SDMA-MAC that incorporates all above four 
elements. 



A. MMSE vs. ZF 

In this section we look at the impact of using the more 
common MMSE (minimum mean squared error) detector 
instead of the idealized ZF (zero-forcing) detector assumed in 
the class of SDMA-MACs DJ-lU. The reason why the MMSE 
detector is more common is because it has the same hardware 
complexity as the ZF detector, but does not suffer from the 
unwanted noise enhancement properties of the latter liTTTl . 
Additionally, one of the drawbacks of the MACs presented in 
JD-G) is that channel access is sequential with current link 
not knowing anything about other links that might access the 
channel after it. In this section we also want to consider the 
potential benefits of relaxing this assumption. We will refer to 
this scheme as the Universal-MMSE scheme and describe it 
in subsection (3). 

1) Zero-Forcing Detection: Using the same notation as in 
section II, link Lq's RX vector under the ZF criterion is 
expressed as: 

W RQ (i) =N{B RQ (i) [Bg Q (z)B flQ «] ei } , (8) 
e 1 = [l 1 0_ : 0] T . (9) 

Q elements 



Here B RQ (i) 



H§!%_(i),Hf cc 



is an Na x Q matrix, and H^ C T (i) is given in Eqn. d6). 
N{-} denotes the vector normalization with unit power. With 
the expression for W Rq ({), we then place it in the equation 
for PPSNR (Eqn. (|5]|7]i) to determine if a given link is active or 
not. From there we calculate the sum throughput as described 
in section II. C. 

2) MMSE Detection: Link Lq's RX vector under the 
MMSE criterion is expressed as: 



W 



-RqI 



»=N{c^ )MMSE (z)-H^ TQ (i)}, 

O-i H 



(10) 



9=1 



2 T 

N N A ■ 



(ID 



Here H^° c r (i) is given in Eqn. d6). Similar to ZF receiver, 
the derived *W Rq (i) is used in the PPSNR calculation (Eqn. 
(|5]|7]i) and subsequently in the sum-throughput calculation. 

3) Universal MMSE: Previous derivations for link Lq's RX 
vector only consider the interference channel coming from link 



Li to L 



(O-i) 



The residual interference caused by L 



(q+i; 



to 



Lk is not considered. This can have a negative impact on the 
PPSNR results. Here we look to answer the question of how 
much the performance of the system might be improved if 
the receive beamnulling was performed at each node with full 
knowledge of all K transmitters. Firstly, link Lq estimates the 
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Fig. 1. SDMA-MAC's throughput performance comparing a ZF, an MMSE, 
and a Universal MMSE RX vector calculation algorithm. 



covariance of the received signal from all K TX nodes as: 

C,r q ,ummse(«) = 

K 

E 

g=i,g#<2 



H^ Tq (i) {h*£ t< (f) }" + a 2 N I NA . (12) 



Based on the estimate C^q.tjmmseW' tne corresponding 
Universal MMSE RX vector is: 

W^i) = N {c^ >UMMSE (i) • H§£ T<J (i)} . (13) 

Again, W_r q (i) is used to calculate the PPSNR (Eqn. (j5]|2)) 
and the sum-throughput. 

4) Simulation Results: We now simulate the SDMA-MAC 
protocol using both the ZF and the MMSE algorithms for the 
RX vector "W Rq (i). We assume perfect channel estimation, 
and as in the case of G]-|0 we fix the MCS for each link 
to either MCS or MCS 5. Fig. [TJ shows the sum-throughput 
as a function of the number of concurrent links allowed by 
the MAC. As expected, initially the sum-throughput increases 
with the number of concurrent links in the network, how- 
ever, as additional concurrent links are added the interference 
power dominates, thus causing a decrease in the network 
sum-throughput. Fig. [TJ also shows that the MMSE receiver 
outperforms the ZF receiver by an average of 10%, and a 
maximum of 20%. The Universal MMSE scheme outperforms 
the ZF receiver by up to 40%. This is because for link Lq, 
the Universal MMSE protocol takes into account the residual 
interference from all links irrespective of the order in which 
they start transmission, whereas the ZF and MMSE solutions 
only take into account the subset of links that accessed the 
channel before link Lq. 

B. Impact of Channel Estimation Errors 

Channel estimation errors impact the sum throughput by 
increasing interference and also reducing the PPSNR. Given 
that the SDMA-MAC uses MIMO beamforming at both the 
TX and the RX, estimation errors will impact both the TX 



vector Wxb(fc) and the RX vector ~W Rq (i). Let us first 
derive the expression of the noisy RX vector ~W Rcj (i). Note 
that this is calculated using either the ZF method (Eqn. ([H}) 
or the MMSE method (Eqn. ( fTOlfljT l). These calculations all 
rely on the following channel information H^° c Tq {i). Under 
imperfect channel estimation, the noisy estimate of H^ C T (i) 
is given by: 



H &, (<) =JPTG RQ ,TjN C HR Q ,T q (i)W Te (i) 



a 2 Z RQtTg (i),l<q<K. 



(14) 



Here Zjj^^Ji) represents the channel estimation noise which 
is modeled as a complex Gaussian random variable with zero 
mean and unit variance, cr^ is the variance of the estimation 
noise, which is dependent on the variance of the background 
noise per subcarrier a% (e.g., under L training symbols and 
least-square estimation, a 1 -, is equal to cr^/L 0). In this 
way, the noisy RX vector Wfl Q (i) is derived by using noisy 
estimate H§^ C T (i) rather than perfect estimate H^° c T(! (i) in 
ZF or MMSE methods. 

Now we look at the derivation of the noisy TX vector 
Wt q (i). Since Wj- e (£) depends on ~W R (i) (q = 1, . . . , Q — 
1), then the noisy estimate of Wr Q (i) will be a function 
of the noisy estimates of other (i). Recalling Eqn. (0 
(i) = Utq (ij Na), we have that Wt q (i) is a function of 
the SVD results of Hi nt f.T Q (i)H^ tf r Q (i)- I n tne presence of 
channel estimation errors, Hi n tf,T Q (i)'s column, H?° t ' f T (i, q) 
in Eqn. (O, will be replaced by the noisy estimate: 

H 



H£tf, Tq (i, q) = { tJPtG r ^ Tq /N c WI (i)H R „ Tq (i) } 

+ ^Z TQ , Rq (t). (15) 

Again, here Zt q ,_r 9 (i) represents the channel estimation noise. 
Using the noisy estimate Hj^'f T (i, q), the noisy TX vector 
WV Q (i) is calculated according to Eqn. (f3]l. 

Finally, given the resulting noisy TX vectors and RX 
vectors, the effective PPSNR can be derived using Eqn. (0- 
|7), and the sum-throughput performance can be evaluated 
accordingly. 

1 ) Simulation Results: We simulate the sum throughput of 
the SDMA-MAC protocol under different channel estimation 
errors. Here the variance of the estimation error is set to a 2 N , 
0.5cr^r, 0.1a%, 0.0la% and O.OOlcr^,, respectively, where a% 
denotes the power of the background noise per subcarrier 
and is equal to -113dBm in our study. Each link's MCS is 
fixed as MCS or MCS 5, and simulation results under 
different number of concurrent links are plotted in Fig. |2] 
The curves in the figure show that, compared with the result 
under perfect channel estimation, system's sum throughput 
is seriously degraded when the estimation variance is a 2 N or 
0.5(7^. Meanwhile, even under estimation variance of O.Ict^, 
and O.Olu^r, there still exists considerable performance loss 
in the sum throughput. Generally, it is safe to assume that for 
any Ad Hoc system, the estimation noise variance will be at 
best O.ler^. 
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Fig. 2. Sum throughput performance of SDMA-MAC under different channel 
estimation errors. Assume that each link uses either MCS or MCS 5. 



Fig. 3. Sum throughput performance of SDMA-MAC under the usage of 
link adaptation. Assume perfect channel estimation and ZF based RX vector 
derivation. 



C. Impact of Link Adaptation 

1 ) Link Adaptation Design: There are 8 different MCSes 
in this paper, and each link can adaptively select the proper 
MCS based on the estimated PPSNR f RQ (i). The derivation 
of r^ Q (i) is based on Eqn. (0 but using the estimated channel 
information as shown in Eqn. (TT~4-b . To study the link adaptation 
in isolation, this subsection assumes perfect channel estimates 
{<7 2 c = 0). Given f RQ . dB (i) = 101og 10 [T RQ (i)j, link L Q 's 
effective SNR is estimated as: 



^eff 
Rq, dB 



7V/-. 



N c 



N c 



(i) 



(0 



i=l 

Backoff 



0.125 



(16) 



Later, link Lq will select the highest MCS whose threshold 
listed in Table I is smaller than the estimated SNR T R dB . 
Finally, rf ^ ckoff in Eqn. ( TToT l is a correction term that makes 
up for the inaccuracy of the PPSNR estimation (e.g., due to 
imperfect channel estimation). Its value can be tuned at run- 
time using the real packet error rate embedded within the ACK 
packet. 

2) Simulation Results: We evaluate the sum throughput 
performance of our generic SDMA-MAC by using the link 
adaptation process discussed above. The results are summa- 
rized in Fig. |3] Here we assume perfect channel estimation in 
the system, and RX vectors are derived via the ZF method. For 
completeness, we also provide the results of fixed MCS selec- 
tion (MCS or MCS 5) in that figure. The resultant curves 
underscore the importance of link adaptation in improving the 
network performance. Compared with the fixed MCS selection 
(MCS or MCS 5) under total 4 concurrent links, the usage 
of link adaptation can provide additional throughput gains of 
around 70% (for MCS 5) to 200% (for MCS 0). 



D. Combining TX Beamforming with SDMA-MAC 

In the generic SDMA-MAC, the TX node T Q of link L Q 
knows nothing about the channel between TX node Tq and 



RX node Rq. In this subsection we pose the question of 
how the performance could be improved if node Tq knew 
about the channel between Tq and Rq, and was able to 
beamform accordingly. Consider the derivation of (i) in 
Eqn. ©, provided that Q < N A , there will be (N A - Q + 1) 
candidates for the TX vector (i) (Ut q (i, Q), Ur Q (i, Q+ 
1), Uy (i, Na)), which can all satisfy the orthogonality 
condition of Eqn. ((T). Besides, any linear combination of these 
candidates is also orthogonal with existing links (Eqn. (Q])). 
This observation indicates that we can choose an optimized 
linear combination of these candidates, so that the resultant 
PPSNR in the desired communication is improved. We name 
this scheme TX beamforming to distinguish it from the TX 
beamnulling scheme (Eqn. (f3])) used in the baseline SDMA- 
MAC. We apply TX beamforming only to links Lq with 
Q < Na, all other links (link L^ A+ i to link Lk) will use the 
default TX beamnulling of the SDMA-MAC. 

1) TX Beamforming Calculation: Consider link Lq with 
Q < N A , we use U^ IT (i) to denote all the TX vec- 
tor candidates at node Tq. This is an Na x (Na — Q + 
1) matrix composed of columns \Jt q (i, Q), Ut q (i, Q + 
1), Utq(*, Na), The resultant TX vector is given as 
W TQ (i) = U^iT(i)D rQ (i), where T> Tq (i) is an (N A -Q + 
1) x 1 column vector with (i)Dt q (i) = 1 representing 
the linear combination of U^ IT (j). Given a specific T)t q (i), 
the calculated PPSNR at the ith subcarrier of link Lq under 
the MMSE criterion is given as: 

T Rq (z) =(p t g Rq , t jn c ) {h^ Tq wu™ it «d Tq 

C R l MMSE (mR Q ,T Q m^ T (i)VT Q (i). (17) 



Here C 



Rq ,MMSE 



(i) is given in Eqn. (fTTT i. Obviously, 
the optimal linear combination vector Dt q (j) can 
be calculated as the maximum eigen-vector of 

{H flQ ,rJi)U^T (i) } H c ^ iMMSE (i)H flQ , TQ (i)U^T (i) 

that is corresponding to the maximum eigen-value. And the 
associated TX vector Wj- g (i) can be calculated according to 
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The number of links in the network 



Fig. 4. Sum throughput performance under our discussed TX beamforming 
design. Assume perfect channel estimation in the network, and each link's 
MCS is either fixed as MCS 0, or adaptively tuned. 
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Fig. 5. Sum throughput performance under different MAC schemes. Channel 
estimation error is fixed as 0.lai r . 



the optimal D T(3 (i) as W TQ {i) = Vf^ T (i)T> TQ (i). 

2) Simulation Results: We evaluate the sum throuhgput 
performance of our reference SDMA-MAC with the inclusion 
of the TX beamforming approach combined with MMSE 
RX vectors. By way of comparison we also provide the 
throughput results of two other schemes, the first includes 
baseline SDMA-MAC with ZF RX vectors (Eqn. ©), and 
the second includes baseline SDMA-MAC with MMSE RX 
vectors (Eqn. (11011 j} ). We assume perfect channel estimation 
in the network, and the MCS in each link is either fixed at 
MCS 0, or varied under the link adaptation protocol. The sum 
throughput results are shown in Fig. [4] Compared with the 
performance of the baseline SDMA-MAC with ZF RX vectors, 
the introduction of TX beamforming can have around 20% 
improvement in terms of sum throughput. When compared 
with baseline SDMA-MAC plus MMSE RX vectors, the TX 
beamforming design can still have more than 10% throughput 
gain. 

IV. Combined Performance Characterization 

After evaluating the impact of each of the four elements 
introduced in this paper in isolation, we now look to compare 
the performance of the baseline concurrent SDMA-MAC with 
the variant that includes the following four elements: (a) 
practical MMSE algorithm; (b) channel estimation errors; (c) 
link adaptation mechanism; (d) incorporation of TX beam- 
forming. For comparison purposes, this section also introduces 
results of a non-concurrent MAC (only one link is allowed 
at any given time) that can employ any number of spatial 
streams less than A^. The non-concurrent MAC also employs 
MIMO TX and RX beamforming, link adaptation and channel 
estimation errors. Detailed settings in these MAC schemes are 
summarized in Table [TT] 

A. Simulation Results 

The simulation results are summarized in Fig. [5] Firstly, the 
baseline SDMA-MAC has the lowest sum throughput, which 



is mostly due to the lack of link adaptation in it. Secondly, the 
enhanced SDMA-MAC has 3x to 4x higher throughput than 
the baseline SDMA-MAC, but its results are lower than that of 
non-concurrent MAC. It is because that under imperfect chan- 
nel estimation, residual interference among concurrent links 
has a significant impact on the overall network performance. 
This is partly the reason why the enhanced design with the 
Universal MMSE scheme has the highest sum throughput. 
With 4 concurrent links it shows a 500% improvement over 
the baseline SDMA-MAC and 40% improvement over the 
non-concurrent MAC scheme. The superior performance of 
our enhanced SDMA MAC over the non-concurrent MAC 
scheme is primarily attributed to the use of multiple antennas 
for spatial interference mitigation and the associated power 
allocation. 

At this juncture it is worth noting that in an actual fully 
functioning MAC, the overhead of the contention window is 
most likely highest for the enhanced scheme with the Universal 
MMSE, and is smallest for the non-concurrent MAC. This 
is because the Universal-MMSE method will require more 
control packets in order to get all the information needed. 

V. Conclusion 

The aim of this work was to investigate how an SDMA 
MAC based on the notion of concurrent links would perform 
in a real operating network that is subject to self interference 
and channel estimation errors that will negatively impact the 
performance. We also looked to bring in link adaptation and 
MMSE based beamforming that are part of almost any oper- 
ating MIMO based system. Our work uncovered two rather 
important results. The first, is that significant performance 
improvements can be had with the combination of MMSE 
based beamforming and link adaptation, even in the presence 
of channel estimation errors. The second is that a single link 
transmission strategy that can use multiple spatial streams is 
rather hard to beat with a concurrent transmission strategy that 
looks to maximize the number of transmissions each with a 
single spatial stream. 



TABLE II 

Detailed Settings in the Considered MAC Schemes 



MAC Scheme 


TX Vectors 


RX Vectors 


Link Adaptation 
(# of streams) 


Link Adaptation 
(MCS per stream) 


Channel 
Estimation 
Error 


Baseline Concurrent 
SDMA-MAC 


TX Beamnulling 
Eqn. {3) 


ZF method 
Eqn. GO 


Each link uses 
only one stream. 


The MCS per 
link is fixed 
as MCS 0. 


0.1 oft 


Enhanced (realistic) 
SDMA-MAC Scheme 


TX Beamforming 
Section III.D 


MMSE method 
Ean. 4101111 


Each link uses 
only one stream. 


The MCS per link 
is adaptively selected, 
(section III.C) 


0Aa 2 N 


Enhanced Scheme 
with the Universal 
MMSE Scheme 
at the RX 


TX Beamforming 
Section III.D 


Universal 
MMSE method 
Eqn. i|12|13| 


Each link uses 
only one stream. 


The MCS per link 
is adaptively selected, 
(section III.C) 


0.1 oft 


Non-concurrent 
MAC Scheme 


SVD based 
method 
see 1121 


SVD based 
method 
see 1121 


Each link can use 
up to streams. 

The number of 
streams is adaptively 
selected to maximize 
the throughput. 


The MCS per stream 
is adaptively selected, 
which is similar to 
the method in 
section III.C. 


0.la% 
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